Talking your way around a conference: a speech interface for remote equipment control

نویسندگان

  • Anuj Gujar
  • Shahir Daya
  • Jeremy R. Cooperstock
  • Koichiro Tanikoshi
  • William Buxton
چکیده

Videoconferencing enables people to attend and participate in meetings from remote locations. The key problem faced by electronic attendees is the limited sense of engagement offered by the audiovisual channel. The attendee is typically restricted to a single view of the room and has no ability to interact with presentation technology at the conference site. As a first step to improving the situation we want to assign electronic attendees a view of the room appropriate to their particular “social roles,” which may include presenting a topic, listening to a talk, or participating in a discussion. However, attendees may change roles during a meeting, thus requiring a different position and view more suited to the new role. This involves switching video inputs and outputs to new cameras and monitors. One possible method to enable video attendees to effect these changes independently is to provide them with the same graphical user interface (GUI) that the central site has to control the equipment. Unfortunately, using state-of-the-art systems for such control is often confusing and complex. Furthermore, this solution requires the attendees to have “extra” computer equipment (i.e. equipment not already required for videoconferencing) and learn how to operate the GUI. Instead, using speech recognition and video overlay technologies, we are able to provide a non-technical interface to equipment in the meeting room. In doing so, we do not require any extra equipment at the attendees’ sites. Our approach provides attendees with the means of controlling their own view of the meeting, changing electronic seats, and manipulating equipment remotely, all through simple voice commands.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DICIT: Evaluation of a Distant-talking Speech Interface for Television

The EC-funded project DICIT developed distant-talking interfaces for interactive TV. The final DICIT prototype system processes multimodal user input by speech and remote control. It was designed to understand both natural language and command-and-control-style speech input. We conducted an evaluation campaign to examine the usability and performance of the prototype. The task-oriented evaluati...

متن کامل

Remote Access of Computer Controlled Experiments

in this paper, we present a way for students to access and operate laboratory equipment, controlled by a laboratory computer via a remote access program. In this way, the solution is not dependent on the specific laboratory equipment, as long as the equipment can be remotely controlled. The system can easily be altered to be used in another laboratory setup. Students are able to make reservatio...

متن کامل

Remote control for videoconferencing

We have designed, implemented, and deployed a camera control system and a conference controller that provide remote control capabilities for videoconferencing over the Internet. The camera control system allows users to pan, tilt, and zoom the cameras, switch between cameras, and get a picture-in-picture view from their desktops. The conference controller allows conference participants to not o...

متن کامل

Voice Operated Guidance Systems for Vision Impaired People - Investigating a User-Centered Open Source Model

People who have impaired vision regularly use white canes and/or guide dogs to assist in obstacle avoidance. Guide dogs can also be of limited assistance for finding the way to a remote location, known as “wayfinding”. Several electronic devices are currently available for providing guidance to a remote location, but these tend to be expensive, or make use of a Braille interface. This project i...

متن کامل

A Text to Visual Speech Instant Messaging System

This paper describes the implementation of text-to-visual-speech instant messaging system using the Remote Method Invocation (RMI) and graphics functionality of Java, together with synthetic speech via the Microsoft Speech API. Our system allows users to communicate over a low-bandwidth network connection using text that is converted into a realistic talking face. The avatar of each user consis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995